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Abstract 

We exhibit a stochastic discrete time model that has exactly the Eigen model [Tas its 
deterministic continuous limit. Such model can be divided into two phases: reproduction, 
followed by neutral selection. This result suggests that Eigen 's model describes the competition 
among individuals differing in terms of fecundity but with the same viability. We explicitly 
write down the Markov matrix of the stochastic model in the two species case and compute the 
master sequence concentration numerically for various values of the total population. Finally 
we compare our results with those obtained with the Eigen model and with the Nowak and 
Schuster birth and death model 

Keywords: Mutation-selection dynamics, Error threshold, Stochastic model. 

1 Introduction 

Eigen's celebrated quasispecies model [T describes the replication of polynucleotides of fixed length 
subject to mutation with the constraint that population size is constant. It consists of a set of 
ODEs, each one governing the time evolution of the number (or of the concentration) of a given 
polynucleotide template. Thus the process is modelized through a set of deterministic equations. 
On the other hand, the process is clearly stochastic, since it involves a mutation probability. The 
deterministic model describe the process exactly only in the limit that the total population considered 
is infinite. This fact was recognized since when the model was first formulated [4J and a series of 
papers jj], [2], [3], [8], [12], have been devoted to extending the model to the case of a finite 
population. However, all these models introduce some approximation, deal with particular subcases 
or introduce some variation of Eigen's hypotheses. In the first part of this paper, we describe a 
new simple discrete time stochastic model that reproduces the Eigen equations in the continuous 
deterministic limit. 

The fact that the approximation of the mutation probability with a mutation rate is exact in the 
infinite population limit will, in general, imply that the differences between the predictions of the 
deterministic model and of a stochastic version of it will become smaller and smaller with increasing 
population size. However, the rate of convergence can depend on the parameters of the model. As 
we will show (see also [3|, for some choices of the mutation probabilities, the rate of convergence is 
zero. Hence it is essential to have a reliable stochastic version of the Eigen model to understand for 
which choices of the mutation probabilities the finite population corrections can play a relevant role. 
In particular, the two species case is very instructive from this point of view. It is simple enough so 
that the stochastic model can be treated (almost) exactly, and, since it involves only two mutation 
probabilities, the results can be plotted on three dimensional graphs; in this way, the differences 
between the Eigen model and the stochastic model can be spotted at a glance. In the second part 
of this paper we explicitly write down the transition matrix for our stochastic model in the two 
species case and we use it to compute the asymptotic master sequence concentration numerically. 
We compare our results with those obtained using the Eigen model and those obtained by Nowak 
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and Schuster [8] through their birth and death model. The results turns out to be extremely close 
even for low values of the total population, for almost all choices of the mutation probabilities. 

2 The Eigen model 

Eigen ^ introduced a model to deal with polynucleotide replication, that we briefly recall here, 
following the exposition given in Swetina and Schuster [10] . The inspiring idea was to compute the 
steady state reached by a population of replicating polynucleotides subject to mutation and selection. 
Let us assume that we have a population of s possible polynucleotide templates Ij^j = 1, . . . ,s; to 
define the model we have to assign to each Ij its synthesization rate Aj, its degradation rate Dj, 
and the probabilities Qij of obtaining the polynucleotide li as a result of an inexact replication of a 
different polynucleodite Ij. The transition probabilities Qij have to satisfy the constraint: 

tQii = i (1) 

i=l 

meaning that the set of s polynucleotide templates li is closed under the replication process (that is 
no new polynucleotides can emerge as a result of an inexact replication). Eigen's equations are a set 
of differential equations governing the evolution of the number Xi of the self-replicating molecule 1^: 

^i = jl i^j Qij - Dj Sij )xj-^/£{Aj- Dj )xj (2) 

where 

According to ([2]) the variation of Xi is composed of a term AiQuXi taking into account the successful 
replication of the polynucleotide 7^, minus a term DiXi due to its degradation rate, plus a term 
Y^i^j Aj Qij Xj giving the number of li generated through an inexact replication of other polynu- 
cleotides, and finally minus a quadratic selection term that keeps the total population constant. In 
fact, thanks to this last term Eigen's equations satisfy 

N = j^x, = 

i=l 

Even if equation ([2]) was developed to model polynucleotides, it is more general and can be 
applied to any kind of self-replicating entities subject to mutation, under the constraint of constant 
population. Hence, from now on we will use the more generic term "species" instead of "polynu- 
cleotide" . 

In this paper we will consider in detail the case of two species 5 = 2, so that we recall some results 
coming from equation ([2]) in this case. In the two species case we can write the Eigen equations in 
terms of the "quality factors" Qn and Q22 only. Posing Qn = Qi, Q21 = I- Qi, Q22 = Q2, Q12 = 
I-Q2, equations ([2|) read: 

XI = {AiQi-Di)xi+A2{l-Q2)x2-^[{Ai-Di)xi + {A2-D2)x2] (3) 
X2 = Ai(l-Qi)xi + (A2Q2-D2)x2-^[(Ai-Di)xi + (A2-D2)x2] (4) 

We will assume that Ai - Di > A2 - D2^ so that xi represents the number of individuals with 
the highest reproductive capability. Such individuals are usually said to belong to the "master 
sequence" (because they correspond to the best polynucleotides sequence) or also to the "wild type" 
(because for low choices of the mutation probabilities they represent the most common type inside 
the population). 
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The system (|3l4p can be linearized through the fohowing time dependent change of variables (see 

El, 0, my- 



Xl{t) 
X2(t) 



exp (--^ £(Ai - Di)xi(t) + (A2 - D2)x2(T)dT^ zi(t) 

exp (-^ J\Ai - Di)xi(t) + (A2 - D2)x2(T)dT^ Z2(t) 



in the new variables equations (01) become: 

zi = (AiQi - Di) zi + A2il - Q2)z2 

Z2 = Ai(l-Qi)zi + (A2Q2-D2)Z2 

We are interested in the asymptotic master sequence concentration, that is in the steady state 
of the ratio xi(t)/{xi(t) + X2(t)). It can be determined by finding the eigenvalues and eigenvectors 
of the matrix: 



W 



( AiQi-Di 
\ ^i(l-Qi) 



A2(l-Q2) 
) ^2^2- £'2 



and is given explicitely by: 



lim 



xiit) 



lim 



Zi(t) 



t^oo Xi{t) + X2{t) t^oo Zi{t) + Z2{t) 
W12 



W12 + K- Wii 
A- - 1^22 

W21 + A_ - W22 
where A± are the eigenvalues of the matrix W: 



if A+ > A_ 
if A+ < A_ 



Wii > W22 
Wii < W22 



(5) 



A± = ^ [(W^ll + T^22) ± ^iWu-W22y+4Wi2W2l] 

In figure[T]we plot ■CbCQij Q2) for A2 = l,Di = 1, D2 = 1 and two choices of Ai: Ai = 4 and Ai = 10. 

We notice that the denominators in formula ^ vanish respectively when W21 = and when W12 = 
0. In the first case we will have Qi = 1 from which it follows A+ = Wn = Ai - Di > A2Q2 - D2 = \-- 
So using equation dSj we get: 

= 1 for Qi = 1 

In the second case we will have Q2 = 1 from which it follows Wn = \+ 
and substituting into equation (0) 



AiQi-Di, A_ = ^2 --D2 



Wn - W22 ^ ^A2-D2+Di 
if Qi > - 



W21 + Wu 







if Qi< 



Ai 
D2 + D1 



for Q2 = 1 



(6) 



Ai 



This last case is of particular interest. Indeed, Q2 - ^ means that the probability of back mutation 
from mutants to wild-type is zero and this means, in turn, that the state in which the population 
is formed of only mutants is an absorbing state for the process; that is, once it is reached it cannot 
be left. If we think of mutation as a probabilistic process instead of a deterministic one, then, if the 
population is finite, there is always a finite (possibly very small) probability to reach the absorbing 
state by the simultaneous mutation of all wild type individuals. Since a finite probability event 
will always happen provided we wait long enough, the only possible steady state when Q2 = ^ and 
population is finite should he = 0. From a more biological point of view, this is due to the fact 
that there is no genetic drift in the deterministic model. Any stochastic version of the Eigen model 
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will introduce the genetic drift in the process, and the drift will always push the system in its only 
stable state, that is the exctinction of the master sequence. In conclusion, the difference between 
the asymptotic master sequence concentration calculated through the Eigen model and through any 
stochastic version of it will always be 

— -— ifQi> , Q2 = l 

W21 + Wii Ai 

independent of the population size. Despite of this fact the Eigen equations are frequently used also 
in this case (see [9] for example). The fact that the deterministic nature of the Eigen model makes 
it unfit to correctly predict the asymptotic master sequence concentration is of particular evidence 
in the case we just illustrated and it was already noticed in ^ and by Eigen himself [4]. However, 
this observation raise the natural question of whether there exist other choices of the mutation 
probabilities such that the deterministic nature of the Eigen model leads to wrong predictions. To 
answer this question a possible strategy consists in: 

• developing a reliable stochastic version of Eigen model (given in section 3), 

• using it to compute the asymptotic master sequence concentrations for a large number of 
possible choices of the mutation probabilities (we explain how this is done for the two species 
case in section 4 and 5). 

• comparing the Eigen model and the stochastic model predictions (that is done in section 5). 



3 A stochastic version of the Eigen model 

In this section we define a discrete time stochastic model of replication with mutation subject to 
the constraint that the population is constant and having the Eigen model as its deterministic 
continuous time limit. Let h denote the time step; at each time step two stochastic processes occur: 
reproduction and selection. We introduce two variables sets x^'^^ and y^'^\ z = 1, . . . , s to describe the 
two processes, x^*^ denotes the number of individuals at the beginning of the n-th time step 

before reproduction and selection, while denotes the number of /^*^ individuals at the n-th time 
step after reproduction but before selection. In the following we explain in detail the two processes, 
while a graphical sketch of them is given in figure O First each specie undergoes reproduction and 
degradation with synthesization and degradation rate Ai and Di respectively. Each 7^*^ individual 
from the parent generation has a probability hDi of being removed from the population, while each 
of the newly synthesized I^'^^ can mutate into the specie I^^^ with a probability Qji. Hence, at time 
step n, the probability that k of the hAjx\^^ (assuming hAjX^^l^ being an integer number) newly 
synthesized I^^^ templates mutate into I^^^ ones is given by: 

p[/(i) t 7(^)] = ^ ^^i^nl j Qf.(l _ Q^^f^^-'il-^ 

The deterministic limit (that is, the limit of infinite population) of this process is obtained by 
considering the evolution of the expectation value instead of the probability distribution. Let us 
denote this expectation value by then: 

= ^^l^ht{A,Q,,-DAj)x^l 

The second phase consists in the random extraction without replacement of N specimens from 
the total population, so that, after this step, the total population will always be composed of N 
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individuals. The probability of extracting k individuals out of the y^^ of species I^^^ is given by the 
hypergeometric distribution: 



^(n+1) 



J = l 

V Vnh I 



that has mean: 



^(0 

'{n+l)h 



^(n+l)/i 



^(0 



fc=0 ^ ' Yfj=iynh 

The last equality states the intuitive fact that the random extraction of TV specimens, on the average, 
does not change the relative concentration of the templates: 



(7) 



X 



(i) 

{n+l)h 



Obviously from equation ([7j) it follows at once that 

Z— / {n+l)h 
i=l 

Hence, in the deterministic limit, the model is described by the following discrete equation: 



-(i) 
N 



(0 



U) 



Si) 



ill). - ^il _ j^^^^Q^^ _ ^^^^^^^ _ ^^^^ _ ^^^^u) ^ 0(h) 



J = l 



J = l 



and taking the limit h ^ we recover the Eigen equations ([2]). 



4 Transition matrix for the two species case 

In this section we write down the transition matrix for our Markov Chain model in the case of two 
species, 5 = 2. For a simple treatment of Markov Chains and transition matrices we refer the reader 
to the book by Grimstead and Snell [5j. 

The choice of the time step h is completely arbitrary, however, since we are interested in an 
asymptotic quantity, in this limit its value should not be particularly relevant, so we fix it to be 
1 here. We also restrict to the case Di = D2 = 1, that is the case when parents never pass to the 
following generation. This case is much simpler to treat since we have only a stochastic process in 
the reproductive phase instead of two. Furthermore, we assume Ai > A2, so that specie is the 
master sequence. Let us write a transition matrix for the reproductive step as described in section 
3. It will be a matrix P of dimension (A^ + 1) x (Ai7V + l), the Pij entries gives the probability that 

xl=N-i + l ^ yl^=Ai{N-i + l) + A2{i + l)-j + l 
Under the assumptions that we made the entries of P will read: 
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r = 0,...,iV k = 0,...,AiN 
Indeed, if there would be no mutation, starting from the initial state 

xl = N-r xl = r 

we would end up with the state 

yl, = Ai{N-r) yl^A^r 

The first three factors on the right hand side of (l8|) give the probability that / among the /^^) 
individuals mutate into /^^^ individuals, meaning that 

The last three factors giving instead the probability that at the same time - k + I of the I^^^ 
individuals mutate into I^^^ individuals, meaning that 

^ Ai{N -r)-l^ C1{N -r)-l + A2r-k + l = Ai{N -r)+A2r-k 

y'^ ^ A2r + / ^ A2r + / - A2r -\-k-l = k 

To avoid a cumbersome notation we let / and k vary from to AiN, exploiting the fact that the 
generalized binomial coefficient 

r(7V + l) 



r{k^i)r{N-k) 

is when k < or k > N. The real intervals of variation of k and / (avoiding factorials of negative 
numbers) would be: 

k = 0,...,Ai{N-r)+ A2r; I = max(0, k - A2r), . . . , min(A;, Ai {N -r)) 

The transition matrix P allows one to compute the probabilities of all the possible outcomes of 
the reproductive phase. For the selection phase, we have to define TV + 1 transition matrices R^'^^ 
of dimensions {AiN + 1) x [N + 1). In fact, depending on the state before the reproductive phase, 
we can have + 1 different values Ti = Ai{N - i + 1) + A2{i - 1) of the total population before the 
selection phase. The entries R^^^j^ gives the probabilities that starting from a total population of 
Ti = Ai{N - i + 1) + A2{i - 1) individuals with composition 

yi = A,{N-i + l) + A2{i-l)-j + l 

yl = i - 1 

we randomly extract a population composed of 

xi.i = N-k + 1 

~ k -1 

That is, using the hypergeometric distribution: 



^L\,r+i k-r ){ r ) /[ k ) 



/c = 0,...,T, r = 0,...,7V 

We can now put together the two steps to obtain the transition matrix M for our stochastic 
model by simply multiplying properly the matrices P and R^'^^: 

AiN+l 

\j 

k=l 

In the appendix we show that, as required for transition matrices, it holds 

N+l 

E Mij = 1 (9) 



Mij= ^ Pifei?«, i,i = l,...,iV + l 
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5 Results 



Depending on the values of (5i,Q2, the Markov chain model belongs to different classes. For < 
(51,(52 < 1 and ((5i,(52) ^ (O7O) the Markov chain is "ergodic". In such case, whatever the initial 
(row) probability vector v be, the system will tend to the unique fixed (row) probability vector of 
M: 

/ wM = w (10) 



lim vM"^ 



the i-th component of w gives the probability that the system will end up in the i-th state, that is 
the probability that 

xl^ = N-i + 1 
= i-l 

and we can calculate the asymptotic master sequence concentration by the simple formula 



N 



(11) 



If (5i = 1, (52 ^ 1 then the Markov chain is "absorbing" with x\ - N being its only absorbing state. In 
such case, whatever the initial state be, the final state will always be the absorbing one. Therefore, 
we will have = TV and = 1- Analogously, if (5i ^ 1,^2 = 1 the absorbing state will be xf = 
and we will have = 0. In the case ((5i,(52) = (l^l) the Markov chain will be again absorbing 
but now with two absorbing states: x] = N^xj = 0. However, when A2 = 1, as in the cases that we 
considered, M will be of the form: 



M = 



1 

Mat,! 




\ 

Mn,n 
1 / 



that is the state xj = can be reached only from xj_i = 0. Therefore, provided that ^ 0, we will 
have = 1. 

The last case we have to consider is ((5i,(52) = (0,0). In this case defines an absorbing 
Markov chain with the two absorbing states xj - N^x\ =0. Since M is of the form 



/ 

M2,i M2,2 



M 



1 







it will alternate among this two states, so that the two possible outcomes will be (generally with 
different probabilities): 



lim = 



lim xl^^^ = 

n^oo 



or 



In this case we decided to set 



lim xl^ = hm xl^_^_l = N 



M 



2N 



lim 



2n 



+ X 



2n+l 



In the cases when the Markov chain is ergodic, we found numerically, using formula (p!Q|) . the fixed 
probability vector w. We report in figure [3] the graphs of the difference among Ce(Qi, Q2) calculated 
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through Eigen equation and Cm (Q 1,^2) calculated through the Markov chain model for N = 50^ 
A2 = Di = D2 = 1 and Qi, Q2 varying among and 1, for the two choices Ai = 4 (a) and Ai = 10 (b). 

From the picture we see that, already for a low choice of TV as = 50, the Eigen model and the 
Markov chain model predictions are in a very good agreement, except near the origin (Qi,Q2) = (0,0) 
and when Q2 = ^ and Qi > l/^i- 

In figure [4] (for Ai = 4) and [5] (for Ai = 10) we report the difference - Cm when Qi e 
[0.001,0.999], Q2 e [0.001,0.999] for the different population values A/" = 2 (a). A/" = 10 (b), TV = 50 
(c), N = 100 (d). Both figures show the same phenomena. We clearly see that the agreement between 

and Cm improves by increasing and at the same time the maximum difference between ^e 
and Cm for Q2 = 0.999 moves toward the error threshold Qi = l/Ai. The difference between C^e and 
Cm near the point (0,0) decreases more slowly with increasing however it becomes considerably 
steeper. 

When Q2 = ^ the transition matrix has the only absorbing state Cm = 0, so that the only possible 
outcome is the extinction of species 1. In this case it is interesting to calculate the time to absorption, 
that is the expected number of generations before species 1 becomes extinguished. It can be obtained 
by the transition matrix through the following procedure. A reduced matrix S is defined from M 
by removing the last row and last column (corresponding to the only absorbing state). The matrix 
I - where / denotes the identity, is invertible. Let c be a column vector of dimension all of 
whose entries are 1, then the expected number of generations before the species 1 becomes extinct, 
when we start by the initial state = - i + 1, is given by the i-th entry of the column vector 

We used these relations to find computationally the expected time of extinction r for some choices 
of the mutation probability Qi when the initial state is = N. We report the results in figure [6] for 
Ai = 2 (a) and Ai = 4 (b). The choice Ai = 10 was ruled out for computational time reasons. From 
figure [6] we can see that, in the proximity of the error threshold the expected extinction time starts 
to grow very fast. Moreover, the steepness of the curve increase by increasing the total population. 

6 Discussion and conclusions 

The standard reference in the literature for the finite population case is the Nowak and Schuster 
[8] paper. In that paper, the authors introduced a stochastic birth and death model approximating 
the Eigen model in the single peak landscape case. We have used the Nowak and Schuster birth 
and death model [8j to compute the asymptotic master sequence concentration ^{Qi^Q2) in the 
two species case (actually, in the original model the mutation probability 1 - Qi and 1 - Q2 are 
supposed to be equal, however the extension to independent mutation probabilities is trivial). In 
figures [3 and [8] we plot the differences among C^e calculated with Eigen's model and ^bd calculated 
with the Nowak and Schuster model. The choice of the parameters is the same as in figures |4] and [5] 
respectively. By comparing figures [71 [8] with figures [H O we see that both stochastic models deviate 
from the Eigen model near the line Q2 = ^- The birth and death model deviations, in this region, 
are slightly larger then those of the Markov chain model. On the other hand, the birth and death 
model do not evidences any problem near the point Qi =0^Q2 =0. 

We recall that, since the Eigen model formulation, it has always been known that its deterministic 
nature confined it to threat exactly only the infinite population case. So that, to cope with the finite 
population case a stochastic version of Eigen model is needed. However, to the best of our knowledge, 
all the stochastic versions of the Eigen model proposed in the literature contain some simplifying 
assumptions ([1], [2], [3], [8], [E])- Let us stress that this is not the case of the stochastic model we 
have proposed in the present paper. 

The fact that, as we have shown, do exist choices of the mutation probabilities such that the 
Eigen model predictions are not satisfactory, nor for large values of the total population A", pushed 
us to perform a comparison between the two species asymptotic master sequence concentration 
predicted by the Eigen model and by the one we introduced. The results show that the only critical 
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regions are those near the point Qi = 0, Q2 = and near the segment Et < Qi < 1^ Q2 = where Et 
denotes the error threshold. Far from these critical regions the agreement between the Eigen and 
the stochastic model is very good also for low values of the total population, despite the fact that 
the Eigen model is obtained by the stochastic model through two subsequent approximations: the 
deterministic and the continuous time limit. Let us stress that the two species case with Q2 = I is 
a relevant limit case of the Eigen model (see for example [9]. Indeed, such case is obtained in the 
following situation. Let us suppose that our species are encoded by polynucleotides of length that 
we are in the "single peak" landscape (that is Ai - Di > A - k - D^^ A^ - Dk = Aj - Dj^ k^j > 2) 
and that there is a uniform probability of point mutation q. If one send the polynucleotides length 
u to infinity and the point mutation probability q to zero in such a way that the expected number of 
mutations p = qu remains finite, then the master sequence concentration can be found through the 
two species model. The first specie will be the master sequence and the other one will contain all the 
mutants, with the quality factors given respectively by Qi = and Q2 = 1- As we showed, in this 
case the Eigen model predictions are wrong, since the master sequence will always go exctint and 
the relevant information is given by the expected number of generations before the master sequence 
exctinction r. This quantity can be computed only through the stochastic model, not through the 
deterministic one. In figure [6l we plotted the logarithm of r, as obtained by our model, versus the 
quality factor Qi for the two choices Ai = 2, A2 = 1 (a) and Ai = 4, A2 = 1 (b). The Eigen model 
predicts that r should go to infinity when Qi > Et. While it is true that r grows steeply in the region 
Qi > Et and that the steepness increase with the total population two further considerations 
are in order. First, the steepness is far less when Ai = 4 (see figure [6lb) than when Ai - 2 (figure 
[6la). This suggest that, if Ai » ^2, the master sequence could go extinct for Qi > in a small 
number of generations, also for high values of the total population N. Second, the value of r at 
the error threshold is higher when Ai = 2 (figure [6la) than when Ai = 4 (figure [Hb). This suggest 
that if Ai ^ ^2, then, even if Qi < a large number of generations could be necessary before the 
master sequence go extinct, especially if the total population N is large. So that one has to be quite 
cautious in using the Eigen model predictions in this limit case. 

Finally, we would like to mention that writing the transition matrix of our stochastic model for 
a given number of species greater than 2 present no conceptual difficulties. However, the number of 
states appearing in the transition matrix is equal to the number of partitions of N + s into s classes. 
This means that already for s = 3, for example, the number of states is equal to {N + 1)(N + 2)/2 
and numerical computation with the transition matrix are affordable only for low values of N. 
Henceforth, to deal with the s > 2 case one is forced to introduce some approximation of the kind 
used in [8]. An alternative approach would be to use computer simulations, and since we have defined 
a completely discrete model, its implementation for computer simulations is really straightforward. 
Moreover, simulating our model is computationally more economic than using the computer model 
introduced by Nowak and Schuster [8]. 

Appendix 

In this appendix we show that equation (|9]) does indeed hold. Note that the Vandermonde identity 



implies 



N+l 



(12) 



Let us now show that it also holds 



AiN 



(13) 
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We have: 

Let us consider a fixed value for / in the second summation. If the sum on k is such that k - 1 spans 
the interval 0, . . . , A2r, then such sum is equal to 1: 

Note that - / = is for sure in the interval of summation over /c, since k has the same interval of 
variation of /. On the other hand let us notice that the maximum value for / is Ai{N - r) since for 
greater values of / the binomial factor 

Ai(N-r) j 

is always zero. Hence k-l = will be in the interval of summation over k if AiN - Ai{N -r) > A2r, 
but this is granted by the condition Ai > A2. The sum over / will now give 1 for exactly the same 
reasons. 

From equations (fT3|) and (p!2|) we get: 

N+l N+lAiN+1 AiN+1 N+1 

j=l j=l k=l k=l j=l 
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Xnh Reproduction i/nh Selection X{n+l)h 




Figure 2: An example of a time step of the stochastic model defined in section 3 for 5 = 3, ♦ = 
•= ■ = hAi = 3, hA2 = 1, hAs = 1. Each individual /^*^ in the parent generation give raise 
to hAi new individuals, each of them having a probability Qji of mutating into an I^^^ one. The 
parent individual has a probability hDi of degradating before the selection phase. In the selection 
phase N individuals are extracted at random (without replacement) from the pool of individuals 
obtained after the reproduction phase. The event probability is associated to each arrow. Notice 
that the probability of any individual of surviving the selection phase is independent on the specie. 
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Figure 4: The difference - as a function ( 
(a), AT = 10 (6), AT = 50 (c) and TV = 100 (d). 
covering the square Qi e [0.001,0.999], Q2 e [0.( 




(d) 



f (Qi,Q2), for Ai = 4, = L)i = L)2 = 1 and AT = 2 
The graphs are drawn using a grid of 625 points 
)1, 0.999]. 
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Figure 5: The difference - as a function of (Qi, Q2), for Ai = 10, A2 = Di = D2 = 1 and N = 2 
(a), N = 10 (b)^ N = 50 (c) and N = 100 (d). The graphs are drawn using a grid of 625 points 
covering the square Qi e [0.001,0.999], Q2 e [0.001,0.999]. 
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Figure 6: Logarithm of expected generation of extinction for the master sequence as a function of Qi 
when Q2 = The curves given correspond (from right to left) to total population N = 50, N = 100, 
, N = 300 and N = 400. Left graph Ai = 2, right graph Ai = 4. Dashed lines correspond to 
the Eigen error threshold. 
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Figure 7: The difference - £,bd as a function of (Qi, Q2), for Ai = 4^ A2 = Di = D2 = 1 and N = 2 
(a), N = 10 (b)^ N = 50 (c) and N = 100 (d). The graphs are drawn using a grid of 625 points 
covering the square Qi e [0.001,0.999], Q2 e [0.001,0.999]. 
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Figure 8: The difference £,e-£,bd as a function of (Qi, Q2), for Ai = 10, A2 = Di = D2 = 1 and N = 2 
(a), N = 10 (b)^ N = 50 (c) and N = 100 (d). The graphs are drawn using a grid of 625 points 
covering the square Qi e [0.001,0.999], Q2 e [0.001,0.999]. 
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